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Abstract 

To combat the detrimental effects of the variability in wireless channels, we consider cross-layer rate 
adaptation based on limited feedback. Based on limited feedback in the form of link-layer Automatic 
Repeat-reQuest (ARQ), we maximize the physical-layer transmission rate subject to an upper bound on 
the expected packet error rate without assuming any particular prior distribution on the channel state. We 
first analyze the fundamental limitations of such systems and derive an upper bound on the achievable 
rate for signalling schemes based on uncoded QAM and random Gaussian ensembles. We show that, 
for channel estimation based on binary ARQ feedback, it may be preferable to use a separate training 
sequence at high error rates, rather than to exploit low-error-rate data packets themselves. We also develop 
an adaptive recursive estimator, which is provably asymptotically optimal and asymptotically efficient. 

Index Terms — adaptive modulation, rate adaptation, automatic repeat request, cross-layer strategies. 



I. Introduction 

The quality of the wireless communication channel depends on many different parameters and can vary 
significantly over time. Potential sources of channel variability include multipath propagation, mobility, 
and time-varying co-channel interference. These sources cause variabilities spanning a wide range of time 
scales, e.g., from bits to thousands of packets. For instance, relative movement of the transmitter-receiver 
pair may cause variations at relatively long time scales, since a very large number of packets can be 
transmitted during the time it takes for the stations to move far enough to cause significant change in the 
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channel. On the other hand, co-channel interference can change significantly from one packet transmission 
to another. Finally, the multipath nature of the propagation medium can cause fast and/or slow fading in 
the channel, depending on the relative movement of the scatterers. 

We propose physical-layer rate adaptation as a means of countering the detrimental effects of channel 
variability (as in, e.g., [1]— [15]). When the transmitter has perfect channel state information (CSI), rate 
adaptation is relatively straightforward. In practice, however, maintaining accurate CSI at the transmitter 
is highly nontrivial. One approach to obtaining CSI is via the inclusion of pilot symbols (as in, e.g., [1]- 
[4]). In such systems, the receiver's channel estimates (or some quantized representations thereof) are 
fed back to the transmitter for use in rate adaptation. Because the inclusion of pilots and non-trivial 
feedback consumes bandwidth on both the forward and reverse links, we are motivated to consider other 
approaches that reduce this resource consumption. In this paper, we focus on rate adaptation schemes that 
infer the channel state by monitoring only packet acknowledgments (ACK/NACKs), i.e., the feedback 
information used for automatic repeat request (ARQ) (as in, e.g., [5]— [15]), and that adjust the physical- 
layer transmission rate accordingly. Since ARQ feedback is a standard provision of the link layer, its use 
by the physical layer comes essentially "for free." 

ARQ systems rely on the presence of error detection for each packet received. When an error is 
detected in a received packet, a request is sent to the transmitter to retransmit the packet. Hence, a given 
packet may be repeated multiple times until it is decoded correctly. While this transmission process is 
sometimes considered to be "adaptive," in that the effective coding rate applied to the bits in a given 
packet varies with the number of retransmissions, we do not consider such standard-ARQ schemes as 
adaptive per se because the transmission rate per packet transmission may not change. In our work, we 
vary the transmission rate used at each packet interval according to the transmitter's ARQ-based estimate 
of the CSI. 

In particular, in this paper, we first derive conditions on the "quality" of CSI needed for a model- 
independent ARQ-based rate adaptation system to maximize data rate while keeping the packet error 
probability below a specified threshold. Based on these conditions, we derive fundamental bounds on 
the rate achievable under a given error probability constraint. Finally, we design an ARQ-feedback-based 
recursive CSI estimator with provable asymptotic optimality. Our findings are illustrated through both 
uncoded QAM and random Gaussian signaling. 

Past works on physical-layer rate adaptation took, for the most part, one of two different approaches. 
In the first approach (e.g., [2]-[4]), a specific channel model is assumed and an adaptive coding scheme 
is optimized around this channel model. In the second approach (e.g., [5]-[8]), adaptation algorithms 
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Fig. 1. The rate adaptation system. 



are designed heuristically, based on practical experiences gained for a specific application in a specific 
operating environment. From our viewpoint, neither of these approaches is fully satisfying, because the 
nature of channel variability is highly dependent on the environment, the platform, and the application. 
Furthermore, the characteristics of channel variability can fluctuate even under a fixed combination of 
environment, platform, and application. Consequently, it is difficult to capture channel variability using a 
single statistical model. The methods that we propose here strive for both statistical-model independence 
and optimality. In this sense, they differ from both of the previously described approaches to physical-layer 
rate adaptation. 

The remainder of the paper is organized as follows. In Section [TTJ we detail the system model and 
provide a mathematical statement of the problem. In Section [Till we derive conditions for successful 
rate adaptation with imperfect CSI, and in Section ITVl we evaluate bounds on the achievable rates with 
ARQ feedback. In Section |Vj we develop an recursive channel estimator based on such feedback, and in 
Section [VT] we conclude. 

II. System Model 

A. System Components 

Figure Q] depicts our model of the physical-layer adaptive communication system. At each discrete 
packet index t, the transmitter transmits a packet Xj = [X tj ±, . . . , Xt >n ] containing a fixed number, n, of 
symbols {-X* fc}5jf =1 , which are encoded at a rate of R t bits/symbol, chosen by the rate controller from 
the set of possible rates 1Z. We assume that the transmit power is constant and normalize all power levels 
such that the energy per symbol is E [jX^) 2 ] = 1. For this packet, the corresponding channel outputs 
are 

Y t>k = H t X t)h + W t ,k, k = l,...,n, (1) 

December 30, 2009 DRAFT 



4 



for complex-valued channel gain H t and additive white circularly symmetric complex Gaussian noise 
Wt,k with two-sided power spectral density N . Some common models for Ht include Rayleigh-, Rician- 
and Nakagami-fading (see e.g., [16]). However, we will not assume any specific statistical model for H t 
and we will make only weak assumptions on the distribution of H t in the sequel. 

The quantity jt — \Ht\ 2 /N a can be interpreted as the t th packet's channel SNR. Since each symbol 
has unit energy, j t is also the received SNR for packet t. Thus, we will simply refer to jt as the SNR. 
We assume that, for all t, takes on values from some prior distribution p(-) € V, where V is a set 
of distributions with finite mean and variance. For the time being, we assume the distribution p(-) is 
known to both the receiver and the transmitter, though later we relax this assumption and consider a 
non-Bayesian framework. Due to lack of power adaptation, j t is an exogenous quantity over which the 
system has no control. 

We assume that the receiver has access to perfect CSI and uses a maximum likelihood decoder to decode 
the received packet. Let X t denote the decoded estimate of packet X t based on received packet Y t = 
[Y^i, . . . , Yt >n ], and the corresponding probability of decoding error be 2(7*, Rt) = Pr(X t 7^ X t | j t , R t ). 
Note that e(-, •) depends on the packet size n and the coding/modulation schemes, which are assumed 
to be known at the decoder. For now, we assume only that the coding/modulation schemes are such that 
e(jt, Rt) is a convex, continuous, and increasing function of R t and a convex, continuous, and decreasing 
function of jf Later, we detail the behavior of our proposed schemes for the specific cases of uncoded 
QAM and random Gaussian signaling. 

Based on the received packet Y t and the decoded packet X t , the decoder generates a feedback packet 
F t which is communicated to the transmitter through a reverse channel. Assuming that the receiver is 
capable of perfect error detection, we take F t to be a binary ACK7NACK (i.e., F t = for ACK and 
F t = 1 for NACK), so that 



Pr(F t = f I lu R t ) = < 



'e( lt ,Rt), f = l 

(2) 

l-e{ lu R t ), f = 



We assume that the reverse channel is error-free but introduces a delay of singldj packet interval. Thus, the 
"information" available to the transmitter when choosing rate R t is I t = [F±, F2, . . ■ , Ft-i, R\, R2, ■ ■ ■ , Rt-i] 
We find it convenient to explicitly include the previous rates {R T } T<t in the information vector I t because 
the ACK/NACK feedback F T characterizes channel quality relative to the transmission rate R T . Note that 

'it is straightforward to generalize all of our results to a general delay of d ^ 1 packet intervals. While the generalization 
does not alter the fundamental nature of our results, it requires a more complex notation, which we avoid for clarity. 



December 30, 2009 



DRAFT 



5 



the controller chooses the transmission rate at time t solely based on the information vector I t , which 
is available at the receiver as well. We assume that the receiver is also aware of the controller's rate 
allocation strategy, so that it can compute the current and previous values of R t . 

Finally, we assume in the sequel that the SNR is constant over each block of T S> 1 packets, and that 
it changes independently from block to block, i.e., that the channel is "block fading." In the sequel, we 
focus (without loss of generality) on the first block, for which t € {1, . . . , T}, and omit the t-dependence 
on the SNR, writing j t as "7." In addition, we use p(7|I*) to denote the posterior SNR distribution, which 
can be associated with the prior distribution p(-j) through the conditional mass function P(F t | 7, Rt) 
given in ©. Furthermore, we denote the set of possible posterior probability distributions using V(It)- 

B. Ideal Rate Selection 

We define the ideal controller as the one that, at time t, based on the available information I t , jointly 
optimizes the transmission rates (Rt, ■ ■ ■ , Rt) to maximize the sum-rate Y1t=i subject to a constraint 
on expected error probability. In doing so, we allow any packet to be declared a probe packet, which is 
exempt from the expected-error-probability constraint but contributes nothing to sum rate. Probe packets 
are used exclusively to learn about the SNR 7, in the hope of more efficient allocation of future data 
packets. In particular, the ideal controller chooses rates according to the following constrained optimization 
problem: 

T 

max y D T R T (3) 

(D t ,...,D T )e{o,i} T - t + 1 , (R t ,...,i? T )e^ T - t + 1 ^— ' 

subject to D T E P [5(7, R T ) \ I t ] < e~ Q for all r = t, . . . , T. (4) 

Here, D T G {0, 1} indicates whether the r th packet is a data packet (D T = 1) or a probe packet 
(D T = 0), and a > is an application-dependent quality-of-service (QoS) parameter. Note that the 
expectation E p [■] in (0]) is taken over the conditional distribution p(y\I t ). The Lagrangian for the time-t 
constrained optimization problem ©-(HJ) is 

T 

L T (D T , R T , X T I I t ) 

T=t 

for 

L T (D T ,R T ,X T I I t ) = D T R T - X T (D T E p [e( 7 ,R T ) \ I^-e""), (5) 
where X t , . . . , Xt are the Lagrange multipliers. The dual problem can then be stated as 

T 

max y min L T (D T , R T , X T I I T ). (6) 

(D t ,...,D T )e{o,i} T - t+1 , (iJ tv ..,R T )e^ T - t + 1 x T ^o 
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Fig. 2. The controller decomposed into two components: a channel estimator and a rate allocator. 



With ACK/NACK feedback, recall that I t = [F 1 , F 2 , . . . , F t -i,Ri, R 2 , ■ ■ ■ , Rt-i]- Thus, the choice 
of Rt affects not only the contribution to the sum-rate but also the "quality" of the conditional SNR 
distribution p(j | I T ) at times r > t + 1. As these future SNR estimates get worse, the controller is 
forced to choose more conservative (i.e., lower) rates in order to satisfy the expected error-rate constraint. 
(We justify this statement in the sequel.) Thus, the selection of Rt has both short-term and long-term 
consequences, which may be in conflict. Consequently, the solution to the ideal rate adaptation problem 
© under ACK/NACK feedback is a partially observable Markov decision process (POMDP) [17]. For 
practical horizons T, it is computationally impractical to implement this POMDP, as now described. 
Firstly, notice that the state of the channel is continuous. Even if the channel state was discretized (at 
the expense of some loss in performance), the required memory to implement the optimal scheme would 
grow exponentially with the horizon T. Indeed, this POMDP lies in the space of PSPACE-complete 
problems, i.e., it requires both complexity and memory that grow exponentially with the horizon T [18]. 

Next, consider the (genie-aided) case of perfect CSI, i.e., I t = 7 for all t. When the channel is known, 
there is no need for probe packets, and thus the optimal solution chooses D T = 1 Vr. Furthermore, 
since the rate choice does not affect the quality of the SNR estimate, the ideal rate assignment problem 
decouples, so that the best choice for R t becomes 

^perf-CSI^ A arg max ^ s . t . £ ( 7; Rfi <; g-o. (7) 

Indeed, with perfect CSI, constraint (0]) is active for all t = T p + 1, . . . , T, since e{pf,Rt) is a convex 
increasing function of Rt and the objective function is linear in R t . Notice that, in this case, ideal rate 
selection is greedy and i?]? eif ~ CSI (7) is invariant to time t. 

C. Practical Rate Selection 

Based on the observations above, we consider a practical (non-ideal) approach, motivated by techniques 
from the field of adaptive control [19], which deviates from the ideal approach in two principal ways: 

2 This invariance holds as long as e(-, •) is t-invariant, i.e., the coding/modulation scheme does not change with time. 
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1) the probe packet locations are set at the first T p packets in each T-block, and 

2) the controller is split into two components: a channel estimator, which produces an SNR estimate 
7(It p+ i) based on the probe-packet feedback It p +i> and a rate allocator, which assigns the data 
packet rate based on 7 (It + i). (See Fig. 12) 

As before, the rate allocator chooses the data-packet rates +1 , . . . , Rt) in order to maximize sum-rate 
under an expected-error-probability constraint. In particular, at each time t € {T p + 1, . . . ,T}, the rate 
Rt is chosen via: 

T 

max } R T (8) 

(R t ,...,R T )&l T - t + 1 

T=t 

subject to E p [e( 7 , R T ) | 7 (It p +i)] < e~ a , for all r = i, . . . , T, (9) 

where the expectation in © is taken over the posterior distribution p(j | 7(1^ +1)). While related, the 
constraints ((4]) and © have an important difference: the information contained by I t in (0]) is summarized 
by the possibly incomplete statistic 7(1^ +1) in ©. Consequently, satisfaction of © does not necessarily 
guarantee satisfaction of (01) or vice versa. 

Due to the the fact that R t \t>T p +i does not affect the quality of future SNR estimates, the rate 
assignment problem I©-© decouples, and the value of R t satisfying I©-© reduces to 

R* t 4 argmax^ s.t. E p [e( 7 , Rt) \ 7 (It p +i)] < e" a . (10) 

Moreover, (TTOl implies that R£ is invariant to time t. Note that the decoupling that occurs here is 
reminiscent of the decoupling that occurred with ideal rate selection ©-(HJ) under perfect channel state 
information, i.e., (|7]). 

In the next section, we shall see that the choice of estimator plays a key role in the overall performance 
of the practical rate adaptation scheme. Recall that the estimator determines p{^ \ 7(1^+1)), which 
determines the expected error probability constraint. Under certain scenarios, we shall see that a solution 
to (fTOl does not exist, i.e., that no rates within 1Z satisfy the expected error probability constraint. 

III. Rate Adaptation with Imperfect CSI 

Before studying the practical rate allocator (fTOl) . we first consider a particular "naive" data-rate allocator, 
in order to draw intuition on how estimation errors affect system performance. Given SNR estimate 7, 
generated from a particular unbiased estimator, the naive allocator assigns the data rate 

Rf ve (j) = arg max Rt s.t. e{j,Rt)^e~ a (11) 
RtZzR. 
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for all t = T p + 1, . . . , T. Due to the lack of expectation in the error-probability constraint of (fTTb . 
the naive rates may violate the desired expected-error-probability constraint in (fTOl) . This follows from 
the fact that, when the posterior distribution p(7t|7t) is non-atomic (i.e., ct^ > 0), Jensen's inequalitjl^ 
implies that 

Ep [efr.lZt) | 7] >e(j,R t ) Vi2 t . (12) 

Therefore, to ensure the expected-error-probability constraint in ( fTOl) , the practical allocator must "back- 
off" the rate relative to i?" aive (7). To do so, it chooses R£ (7) ^ i?" aive (7), where equality occurs if and 
only if the estimation error N = 7 — 7 is zero-valued (with probability one). 

When the estimator is perfect (i.e., 7 = 7), we note that the naive rate coincides with the ideal rate 
under perfect CSI (i.e., i?" aive (7) = i2]? eif ~ CSI (7)| 7= ^). In this case, i?" aive acts as an upper bound on the 
ideal R t under ACK/NACK feedback, as specified by ©-(HJ). Accordingly, we make the following two 
definitions. 

Definition 1: The rate penalty associated with estimator 7 is the smallest 5 (in bits/symbol) that satisfies 

E p [e( 7 ,R n r e (l)-S) I 7] ^e~ a . (13) 
Definition 2: The power penalty associated with estimator 7 is the smallest scale factor p that satisfies 

E p [e( M7 ,i2f Ve (7)) I 7] <e" a . (14) 

Next, we analyze two different scenarios for the described rate adaptation system. In the first scenario, 
the n symbols in the packet are assumed to be uncoded QAM symbols, while in the second scenario, 
the n symbols are a Gaussian random coded ensemble. Within the second scenario, we focus on the 
high-SNR and low-SNR cases separately. For both scenarios, we use the analysis presented next, in 
Sec. UFA] 

A. Gaussian Approximation of the Estimation Error 

Under the posterior distribution p(7|7), let the estimation error N = 7 — 7 have the distribution 
q(N\ A /) = p(N + 7(7)- Let gN\~f( r ) an d A^^(r) denote the moment generating function and the semi- 
invariant log moment generating function of N given 7, respectively. We assume that there exists some 

3 For unbiased 7, l !12t immediately follows from Jensen's inequality. For biased 7, d 1 2b still holds but requires some effort 
to derive. We skip these details since our focus is on unbiased 7. 



December 30, 2009 



DRAFT 



9 



r max > such that Ajv^fV) < oo for all \r\ < r max - It is well known [20] that A A r|^(0) = 0, A' N ^(0) = 
Eg [N\% and A^|-(0) = a 2 N ^. Then, for any \r\ < r max , 

Eg [exp(riV) | 7] = g N ^(r) = exp (A N ^(r)) (15) 

= exp (e 9 [N I 7] r + ^A^- (r')r 2 ) (16) 

for some r' between and r (having the same sign as r), where ([Tol follows from Taylor's theorem. 
From Jensen's inequality, we know that 

Eg [exp(riV) j 7] ^ exp (Eg [N | 7] r) . (17) 

From Eq. (fT6l ). one can deduce that the ratio of the two sides of (flTT ) is exp (^A^|^(r')r 2 ), which is 
strictly greater than 1, since the log moment generating function is convex [20]. Furthermore, applying 
Taylor's theorem to the third-order expansion, we get 

g m (r) = exp (e, [N | 7] r + \a 2 N ^ + \h"^(r")r^ (18) 

for some r" between and r. 

In many cases, the first two terms of the expansion ( fT8l ) not only closely capture the variability of the 
estimation error N, but also lead to insightful expressions to illustrate the impact of the first- and second- 
order statistics of "channel variability." This will be referred to as the Gaussian approximation, since, 
when N\j is Gaussian, the cumulants of higher order than the variance vanish. Note also that, whenever 
the distribution of N\^f is symmetric around its mean, its skewness is zero and thus A'^(0) = 0. The 
Gaussian approximation works well in this case too, since the effects of the higher-order terms in the 
Taylor series expansion often diminish quickly (e.g., for many distributions, the cumulants decay as the 
order of the cumulants increase). 

Recall that we focus on unbiased estimators, for which Eg [N | 7] = 0. In this case, the Gaussian 
approximation yields the simple second-order approximation 

A^(r) « ^V 2 " (19) 
The approximation (fT9l) is asymptotically accurate for the estimator proposed in Section [V] which is 
asymptotically unbiased and asymptotically normal, as will be seen. 

B. Rate Adaptation with Uncoded QAM 

Here, we study the scenario in which the n symbols {X ti k}k=i °f packet t are uncoded and selected 
from a QAM constellation of size M t . Since the constellation size is constant over the packet, the rate 
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tighfl; 



equals R t = log 2 M t bits/symbol. The following is a tighu approximation [1, p. 289] on the symbol error 
rate associated with minimum-distance decision making [21, p. 280]: 

3 7 



£k(l,Rt) ~ 0.2 exp 



2 2^* 



1 



The associated packet error rate is 



s( 1 ,R t ) = l-(l-s k ( 1 ,R t )) n , 

since £^(7, Rt) remains constant for all k, as 7 and Rt remain constant over the packet. 
Since we can write 

(l- £fc ( 7)J R t )) n < l-ne k (rf,Rt) + ^n(n-l)el(rf,Rt), 



(20) 



(21) 



(22) 



it follows that e(j,Rt) > §£fc(7, Rt) for all (7, -Rt) such that ek(j,Rt) < ;^rr- Similarly, (1211 implies 
that 2(7, R t ) < 1 — (1 — -^) n f° r the same (7, R t ). This latter bound is an increasing function of n, and, 
for n 3> 1, it approximately equals 1 — e~ 1 , which is much higher than typical error rates. We assume 
that n is large enough and the possible outcomes of (7, Rt) are such that e{^,Rt) > §£fc(7, Rt) for all 
t with probability close to 1. We further elaborate on this assumption after deriving the condition next. 
To meet the expected-error-probability constraint ( fTOl , it is necessary that 



^E p [e k (-f,R t ) I 7] » 



0.2 exp 
0.2 exp 



1 



2 2 R < - 1 

3 7 + iV 



7 



7 



^ e~ a . 



(23) 



Using the unbiased Gaussian approximation 1191 ), condition 
the natural log of both sides: 



2 2^' - 1 

can be rewritten as follows, after taking 



7 



a 



+ 



JV|7 



1 



< —a — InO.ln. 



2 2^*-l 2 V 22 ^- 1 , 
Condition (1241 implies that, for the existence of a feasible rate R t , it is necessary that 



7 



5s 2(a + InO.ln). 



(24) 



(25) 



'jv|7 



Condition (1251 implies that T^/c^i^ the effective SNR of estimator 7, must be at least 2(a + InO.ln 



to guarantee an expected error rate of e a . Using similar stepsjf] a sufficient condition 7 2 /c^|^ ^ 



4 The bound holds within approximately 1 dB from the true value for a wide range of SNRs [1, p. 289]. 

5 From ( 12 1 b and the fact that (1 — £t,k) n > 1 — net,k, we have e(-y,Rt) ^ neki'y, Rt) for all (t, k) with probability 1. 
Consequently, for satisfaction of d lOt . it is sufficient that nE p [ekiy, Rt) | 7] ^ e~ Q . Replicating d23t-(l25l>. we obtain the 
sufficiency condition. 
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2(a + ln0.2n) can also be derived, illustrating the tightness of (l25l) . We will investigate the difficulty of 
achieving this condition in the next section. 

Given that d25l ) is satisfied, one can solve (1241 ) to find the upper bound < ^(7, ct^-,^), where 



N\*t) 



log 2 



1 + 7 



V 



3 °JV| 7 

2~ 



1 2 



1 - \ 1 - 2(a + ln0.1n)- 



(7 



AT| 7 



7" 



(26) 



Fig. 3(a)| plots the upper bound (l26l ) as a function of the estimator's effective SNR 7 2 /°1v|7 for 7 € 
{13, 20, 25} dB, a desired packet error rate of e~ Q = 10~ 3 , and a packet size of n = 500 symbols. The 
naive rate allocation 

: (7) = log 2 l 1+7.: — — J (27) 

T 2 



^>naive 



2a + ln0.1n/ 

(derived from d23l) with iV = 0) is also shown on the same plot. The required effective SNR 7 2 /°jvh 
as imposed by (|25]>, is 21.6 here. Fig. [3(a)] shows that Rf^tf) < 2 bits/symbol for 7 ^ 13 dB. Since 2 
bits/symbol is the minimum possible rate for uncoded QAM, we conclude that it is impossible to meet 
the target packet-error rate of 10 -3 when 7 ^ 13 dB, even with perfect CSI. 




f> 2 



N|7 



Fig. 3. 
bound [i 



For QAM signaling, (a) rates R\ and Rf" ve versus estimator's effective SNR j 2 and (b) power penalty lower 
versus estimator's effective SNR 7 2 / <t jv|7 > 



By definition, the rate penalty is the smallest 5 that satisfies 5 = i?" aive (7) — ^(7, <y^L.). Thus, an 
upper bound on 5 is given by 
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From Fig. 3(a) we can see that 5 (7, c 2 N ^) depends on the effective SNR 7 2 /°1v|7 : it is significant when 



the effective SNR is near the minimum value established by (|25T ). but shrinks as I 2 /o' 2 N ^ l gets large. In 
addition, S(ff, grows in proportion to 7. 

By definition, the power penalty is the smallest n that satisfies -Rf (7) = -R" aive (7//Li). Thus, a lower 
bound /u(7, fT^-i-) on the power penalty can be found by solving R^(j, = -R" aiV£ : (7 7 7-0 f° r A 4 - The 



power penalty lower bound //(-y, cr^-,-,) is plotted in Fig. |3(b) as a function of effective SNR I 2 /o~ 2 N \~ for 
the same expected packet-error rate, 10~ 3 , and packet size, n = 500, as in Fig. 3(a) The power penalty 



is seen to be as high as 3 dB when the effective SNR is near the minimum value established by (1251 ). 
but shrinks as ^V^jvl-y gets large. 



C. Rate Adaptation with Random Gaussian Ensembles 

Next, we study the random coding [22], [23] scenario in which the codewords are selected from a 
Gaussian ensemble. Let -R max be the maximum rate in 1Z. Then the Gaussian ensemble consists of 
2"" Rmax possible packets, where each symbol, X t ^, of packet t is chosen independently from a jV(0, 1) 
distribution]^ (We use unit variance here because earlier we assumed E [JJQ^I 2 ] = 1.) At time t, say that 
transmission rate Rt € 1Z is chosen. Then one packet from a size-2 nflt subset of the initially generated 
set of 2 niJmax packets is chosen arbitrarily for transmission. 

The receiver is assumed to know the subsets of possible packets corresponding to each admissible 
rate Rt £ TZ. Based on its observation of the t th packet, the receiver finds the most likely packet within 
the subset of 2 nRt possible packets. Note that, unlike the uncoded QAM scenario, where each symbol 
is decoded separately, here the entire packet is decoded as a unit. An upper bound for the associated 
decoding error probability is (e.g., [22]) 



e(7, R t ) < exp np 



R t In 2 - - In f 1 + 

2 V 1 + P 



(29) 



where p 6 [0, 1] is the union bound parameter. One can minimize (1291 over p £ [0, 1] to find the tightest 
bound, if so desired. To satisfy the expected-error-probability constraint ([T0l . it suffices that there exists 
a p 6 [0, 1] for which 



exp np 



R t ln2 - - In (1 + 

2 \ l + p 



7 



^ e 



(30) 



We use real-valued symbols, instead of complex-valued symbols, for simplicity. Consequently, the data rates will be 
represented in units of bits per real-symbol. For fair comparison with uncoded QAM, one should simply double these data 
rates. 
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1) Low-SNR Regime: When Pr(7 <C 1 | 7) ~ 1, we can write 

lnfl + ^-U^- = i±^. (31) 

For an unbiased estimator, E g [^^|7] = and var^^^]^) = ^"^ 2 . Thus, using the Gaussian 

approximation ( fl9l ), the constraint (1301 is satisfied if there exists a p G [0, 1] for which 

7 1 np 2 

2(1+7) + 8 (1 + p) r 



a ^-np[R t In 2 - _ 7 _^ + _ 7 _r_ ^ | . ) , (32) 



or, equivalently, for which 

K '^ln2^ np + 2(l + p) 8 (1 + p)^^) ' ^ 
Thus, if there exists some p G [0, 1] for which the right side of (l33l) is positive, then any R t below it is 
feasible. For this to be possible, we need 

2a(l + pf - 7 np(l + p) + J(np) 2 ^,- < 



for some p € [0, 1], which leads to the following necessary conditional for the estimator: 

^ 2a. (34) 



One can then find an upper bound on R t € 1Z satisfying (1301 ) as follows: 

n*t- 2 s 1 ( a 7 1 np 

ii+ (7, ct ru , ) = max ; r r^f^u, ■ (35) 

' ^ P £[o,i]ln2V np 2(1 + p) 8(l + p) 2 N \i' 



Likewise, one can deduce from (I29l > and (1311 ) that the naive rate is 



Rf ve (j) = max -J- ( - — + — ^— ) . (36) 
' w; P e[o,i]ln2 V np 2(1 + p) J 



The rate upper bound i?j is plotted in Fig. |4(a) as a function of the estimator's effective SNR I 2 /& 2 N \~ 



for 7 G {—3, —8, —12} dB, a desired packet error rate of e~ a = 10~ 3 , and a packet size of n = 500 
symbols. The rate i?" aive from d36l ) is also shown on the same plot. Every point on the rate curves was 
computed using the optimal value of p G [0, 1], found numerically. We note that, with these parameters, 
(l34b implies that 7 2 /°iv|7 must be at least 13.8. Figure [4(a)] also shows that the rate penalty 5(7, cr^<~) = 

i?" aive (7) — ^£(7, er]L^) is significant when 7 2 /°ivu is near the lower bound established by d34l ), but 

2 

AT|7 



that the rate penalty shrinks as 7 2 /c 2 vr i^ increases. 



7 Note that condition i34i is not exactly analogous to condition {25k Condition ( 1341 ) is necessary for a non-empty solution 
set to exist for inequality J331 >, whereas ( 1251 ) is necessary for the existence of a feasible rate that satisfies the expected-error 
bound. In order to derive an analogous necessary condition, one can use a sphere-packing (SP) bound for the Gaussian channel 
(see, e.g., [24]). With the SP lower bound, our findings would be qualitatively similar, but the derivation would be extremely 
tedious. For this reason, we assume that the upper bound is a good approximation for the actual error rate. 
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Fig. 4. For Gaussian signaling at low SNR, rates R1 and _R" alve versus (a) estimator's effective SNR -y 2 /0jL- and (b) estimated 
SNR 7. 



For the same target packet error rate (10 -3 ) and packet size (n = 500), Fig. |4(b)| plots versus 7 
for estimator effective SNR I 2 /<? 2 N ^ g {^0, 100}- I n tne same figure, i?" aive and the "naive" Shannon 
limit (i.e., ergodic capacity) ilog 2 (l +7) bits/real-symbol are shown. By comparing the naive Shannon 
limit with i?" aive , one can observe that, in the low-SNR regime, the power penalty of Gaussian signaling 
scheme can be significant, especially at small values of 7. From the same plot, one can observe that the 
additional power penalty due to imperfect SNR estimation, p{^), is quite small: less than 0.5 dB when 
7 2 /cr^^ = 100 and less than 1 dB when 7 2 /crL- = 60. 

2) High-SNR Regime: When Pr(7 S> 1 | 7) ~ 1, we can write 

Thus, for an unbiased estimator, Eg [in x+Ht] ~ m j+p anc ^ var< ?0 n T+pl^) ~ ~^ L - Similar to the low 
SNR scenario, we can use the Gaussian approximation (fl9l ) to claim that (l30l ) is satisfied if there exists 
a p 6 [0, 1] for which 

1 / 7 \ 1 np 



or, equivalently, 



a> -n^Ran2--Jn [ — )+ - w ^-j, (38) 



If a 1 / 7 \ 1 np 



Rt < ; h - In —— tt^5— • (39) 

In 2 1^ np 2 \l + pj 
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Hence, if there exists some p £ [0, 1] for which the right side of (l39l ) is positive, then any R t below it 
is feasible. In the high-SNR regime, we have 7 3> 1 with high probability, and thus there almost always 
exists some p G [0, 1] for which a feasible R t > exists. One can deduce from this observation that, a 
principal difference between the high-SNR and low-SNR regimes is that, in the high-SNR regime, the 
expected error probability constraint is satisfied much more easily, with nearly any SNR estimator. One 
can then find an upper bound on R t € TZ satisfying (l30b as follows: 



i?*( 7 , a 2 ) = max J_ f + I ln ( _±_\ _ I . (40) 
Likewise, one can deduce from d29l ) and (I37T ) that the naive rate is 

Maive/-) = J_ ( _^ + I ln ( _J_ ) ) . (41 ) 

* K " p6 [0,i]ln2V np 2 \l + p ) J 




Fig. 5. For Gaussian signaling at high SNR, rates R1 and i?""™ versus (a) estimator's effective SNR j 2 /<t' 2 n ^ and (b) estimated 
SNR 7. 



The rate upper bound (7, & 2 N ^) is plotted in Fig. |5(a)| as a function of the estimator's effective SNR 



7 2 /<t^|^, for 7 € {13, 20, 25} dB, a desired packet error rate of e a = 10 3 , and a packet size of n 



•' . — 500 

symbols. The rate /2" aive (7) from (|4TT > is also shown on the same plot. Every point on the rate curves was 
computed using the optimal value of p £ [0, 1], found numerically. We emphasize that the rates plotted 
in Fig. |5(a)| are expressed in bits per real-symbol, and thus should be doubled for fair comparison with 



the QAM rates presented in Fig. |3(a)| For Gaussian signaling, if we compare the high-SNR results in 
Figs. |5(a)|5(b)| to the low-SNR results in Figs. |4(a)H4(b)[ we can see that the normalized rate penalty 
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5/Rf is much smaller in the high-SNR regime. For instance, at 7 2 /°iv|7 = 20, 6 is no more than 0.5 
bits/symbol and 5/R% is less than 25% for all three values of 7. This decrease in rate penalty is expected, 
since, in the high-SNR regime, the rate scales roughly with the log of the SNR. 

For the same target packet error rate (10~ 3 ) and packet size (n = 500), Fig. |5(b)| plots R£ (7, o~ 2 N *~) 
versus 7 for estimator effective SNR 7 2 /^|^, 6 {60, 100}. In the same figure, i?" aive (7) and the naive 
Shannon limit ^ log 2 (l+7) are shown. There we observe that, in the high-SNR regime, the power penalty 
for Gaussian signaling is constant with 7, and no more than 1.5 dB. The additional power penalty due 
to imperfect SNR estimation, ^(7, <r^,~), is approximately 1 dB when 7 2 /c 2 v ^ = 60 and approximately 
2.5 dB when 7 2 /°jvw = 20. 

IV. Fundamental Limitations of ACK/NACK-Based Rate Adaptation 

In the previous section, we studied the performance of the rate adaptation system for a generic unbiased 
estimator. We analyzed the feasible rates with particular coding/modulation schemes as a function of the 
"quality" of the estimation provided by the estimator, for which the relevant metric was the estimator's 
effective SNR -y 2 /cjL-,. Note that we assumed no knowledge of the prior SNR distribution p{^). 

In this section, we view the SNR of the current block, 7, as an unknown parameter^ and pose the 
estimation of 7 as a non-Bayesian parameter estimation problem. We first investigate the fundamental 
limitations of SNR estimators that are based on packet-level ACK7NACK feedback, e.g., 7 = 7(It p+ i). 
Using that analysis, we show that it is difficult to make good SNR estimates while simultaneously keeping 
packet-error-rate low. This latter property motivates SNR-estimation via probe packets that come without 
error-rate constraints (in contrast to data packets, which are error-rate constrained) as assumed in Sec. ITT] 
Finally, we discuss optimization of the probing period T p , and we derive an upper bound on the optimal 
sum rate -R| um . 

A. Fundamental Limitations of ACK/NACK-Based SNR Estimation 

Consider the SNR estimator j(I Tp+ x), based on the T p ACKVNACKs in 

I Tp +i = [F U F 2 ,..., F Tp , R u R 2 , ...,R Tp ], 

where R t denotes the rate and F t denotes the ACK7NACK feedback for packet t. In the sequel, we 
abbreviate 7(1^ +1) by 7. Recall that R t and F t are connected through the packet error probability 
s(j,Rt), as specified in (0. 

8 We assume that 7 is a random variable, taking on an independent value for each block, but that the distribution of 7 is 
unknown to the transmitter. 



December 30, 2009 



DRAFT 



17 



Theorem 1: For true SNR 7 and any unbiased estimator 7 based on T p ACK/NACKs, the estimation 
error variance, cr 2 N ^ = var (7 — 7(7), is lower bounded by: 




e( % R t )[l-e( % R t )} 



Then 



l-e(j,R t ) \e(j,R t ) 
The Fisher information [25] associated with F t is: 

<5>( 1 ,R t ) = ™v(V( 1 ,R u f t ) I j,R t ) 

e(i,Rt) [l-e( r/ ,R t )Y 
:1 &(rf,Rt). Theoremfflfc 
bound (CRLB) for unbiased estimators is the reciprocal of the Fisher information [25]. 



(42) 



where e(j,R t ) is continuously differentiable in 7 and £'(7, Rt) = ^£(7, i?t). 
Proof: Given 7 and the rates R\ , . . . , Rt p , the feedback F± , . . . , Ft p satisfies 

% 

Pr(Fi = A, . . . , F Tp = f Tp I 7, i? l5 . . . , R Tp ) = J] Pr(F* = / t | 7, i^) . (43) 



F( 7 , i?t, / t ) = ^ lnPr(F t = f t | 7, i? t ) 

= ^ln([e( 7 ,^)] / 11-e(7,^)] 1 " /t ) 

£ '(7,^) f /* x \ (44) 



(45) 

and the cumulative Fisher information is Y2t=i ^("fi R-t)- Theorem [T] follows since the Cramer-Rao lower 



B. Lower Bounds on the Required Probing Period T p 

In Sec. [nil we derived lower bounds d25l) and (l34l on the value of 7 2 /o~ 2 N ^ (i.e., the estimator's effective 
SNR) required to facilitate the use of data transmission via uncoded QAM signaling and randomly coded 
Gaussian signaling, respectively. In this section, we translate those lower bounds (on required ^ 2 /o~ 2 N \~) 
into lower bounds on required probe-duration T p , recognizing that the quality of SNR estimates (and thus 
I 2 /o~ 2 N ^) increases with T p . From these bounds, we shall see that the required value of T p depends heavily 
on the probe error rate, and in particular that the required value of T p grows very large as the probe error 
rate grows small. This motivates the optimization of probe error rate, which requires the decoupling of 
probe error rate from data error rate (since the latter is usually constrained by the application). 
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In this section, we assume that both the modulation/coding scheme and the rate is fixed over the probe 
interval, i.e., that Rt = Rp for t G {1, . . . , T p }. In this case, the CRLB (|42l reduces to 



2 1 e( 1 ,R p )[l-e( 1 ,R p ) 



(46) 



(e'fr.-Rp))' 
which is inversely proportional to T p . 

Recall that, to make uncoded QAM signaling feasible, condition (l25T ) must be satisfied, and to make 

random Gaussian signaling feasible in the low-SNR regime, condition (l34l must be satisfied. Though 



251 ) and (1341 ) are expressed in terms of the estimator's effective SNR, we can rewrite them as cr N ^ ^ 



^7 2 /(o + ln0.1n) and Cjv| 7 ^ 57 2 / Q > respectively, and apply the CRLB (l46l ) to arrive (see Appendix lAl) 
at the following. For uncoded QAM, we need T p ^ T p mm , where 

2(q + ln0.1n) e(j,R p ) 



rpmm 
1 P 



(47) 



(1 - e( 7 , Rp)) [(1 - e( 7 , i^p))" 1 /" - l] 2 In 2 (5(l - (1 - e( 7 , Rp))^)) (n 7 / 7 ) 2 ' 
and for random Gaussian signaling in the low-SNR regime, we need T p ^ T p min , where 

T min = 8a(l-e( 1 ,R p ))(l + p*+ 1 ) 2 
p e(7,i? p )(np*7) 2 
and where p* is the union bound parameter corresponding to the tightest error bound d29l ), which itself 

depends on 7, i? p , and n. 



10 
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e(7.-Rp) 



Fig. 6. Lower bound on required probing duration T^ 111 versus probe packet-error rate e(7,7? p ) for (a) uncoded QAM and 
(b) random Gaussian signaling. 

Figures (6fa)-(b) plot T p mm as a function of the probe error rate £(7, R p ) for uncoded QAM signaling 
and random Gaussian signaling, respectively. For the plots, we assume 7 « 7, which eliminates the 
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dependence of T™ in on 7 and 7 in the QAM case; for the Gaussian case, we show T p for the values 
7 € {—3,-7,-10} dB. As in our previous plots, we assumed n = 500 and e~ a = 10~ 3 . The key 
observation to make from these plots is that the number of probe packets increases quickly as £(7, R p ) 
shrinks. In fact, the plots suggest that T p is roughly proportional to l/e(j,Rp). This inverse relationship 
is somewhat intuitive because, given a probe packet-error rate of 5(7, R p ), one must wait for l/e(7, R p ) 
packets (on average) to see a single NACK. Recall, however, that Fig. [6] shows only a lower bound T p min 
on the probe duration required for communication with positive rate; the optimal value of T p is expected 
to be even larger. 

The main conclusion to draw from this section is that, to keep the probing period small, one must 
allow relatively high probe error rate £(7, R p ). For systems which estimate SNR using only ACK7NACK 
feedback from data packets, this implies that if the data error rate e~ a is small, then the number of 
packets required to get a decent SNR estimate will be large. Such systems would only be suitable for 
channels that are very slowly fading. 

C. An Upper Bound on the Optimal Sum-Rate 

p 

Recall that, in our practical rate adaptation system, the data packet rates {Rt} t =T p +i we chosen based 
on the SNR estimated using ACKVNACKs from probe packets with rates {Rt)tL\- To complete the 
system design, we must choose the rates {R t }J =l as well as the probe duration T p . In doing so, we 
aim to maximize the sum data rate i? S um = Ylt=T p +i &t while satisfying the expected error-probability 
constraint in ( fTOl . Intuitively, we know that increasing T p improves the SNR estimate which, in turn, 
allows a higher data rate (since less rate "back-off" is needed to satisfy the error constraint). On the 
other hand, for a fixed block length T, the number of data packets, T — T p , shrinks as T p increases. 
Therefore, the choice of T p involves a tradeoff between these two objectives. In this section, we discuss 
the choice of {T p , . . . , Rt} and derive an upper bound on the sum rate -R S um that leverages the rate 
bounds from Sec. |III] and the CRLB from Sec. ITV-Al 

In Sec. III-CI we recognized that the data-rate assignment problem decouples in such a way that the 
optimal data rates {Rl}f =T +1 become independent of time t. Thus, in the sequel, we focus on choosing 
a single data rate R d , whose optimal value will be denoted by R^. The system design problem then 
reduces to the following sum-rate maximization: 

R* sum 4 ma x (T — T p )R d s.t. E p [e( 7 , R d ) \ j(I Tp )] ^ e~ a . (49) 

T P <T, (R 1 ,...,R Tp ,R d )en T f + 1 

As argued in Sec. [Till the optimal data rate R d increases monotonically with the quality of the SNR 
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estimate, i.e., with the inverse of the estimator variance Thus, the optimal probe parameters 

{Tp,R\, . . . ,Rt p } are those that minimize cjLa. From the CRLB in Theorem Q] we know that ctL- ^ 



|-. num uis ^vx.,^ 111 iu^ihiiuj jviiuw uim ^|. 
.^|^(7). Where 

2 / \ A - ^ [£ / (7,^)] 2 ^ 1 ,- m 

£ ^ (7) = (Jtp .X)^ [g e ( 7> *)[l- e ( 7> i*)] ) (50) 

Vmax , Y\ (51) 

Rt&Tl ■, Rt) [1 ^(T) ^t)] J 
Thus, if 7 was provided by a genie, and if the SNR estimator was efficient (i.e., CRLB achieving), then 
d5TT > suggests to set the probe rate at 

Rf nie ( 7 ) = arg max - (52) 



which is invariant to both time t and probe duration T p . This yields 

2 , , _ 1 g (7^r nie (7))[i- g (7^r nie (7))] 

Q-NfiW) - 7p — ^genie, ' ^ 

Jp [e'(l,Rp ( 7 ))] 2 

Using the genie-aided probe rate i?p 6r " e (7), we can upper bound the optimal sum rate d49l ) by 

< e~ a , (54) 



«' = T max (T-Tp)^ s.t. E p e ( 7) R d ) \ 7 «" fr). T p ) 

where we explicitly denote the dependence of the estimate 7 on both T p and i?p enie ( 7 ). 

Next, recall that we established, in Sec. JIIJ upper bounds on the largest data rate that satisfies an 
expected error constraint of the type in d54l . In particular, d26l ) gave an upper bound for uncoded QAM 
signaling, and (|35T ) and (l40l ) gave upper bounds for Gaussian signaling in the low-SNR and high-SNR 
regimes, respectively. These data-rate upper bounds, ^(7, cr^L..), can be applied to (l54l to bound the 
optimal sum rate as i?| um ^ ^!um> where 

^sum - ™ (T - T p )R* d ^, a 2 N{ ~), (55) 

and where 7 and ^ 2 N ^ are dependent on both T p and -R p ( 7 ). Since 7^(7; ^jv^) increases monotonically 
in l/a 2 N ^, we can upper bound R% using the lower bound on o 2 N ^ established in (I53T ). This yields 

T>* pmax f nr 

- rt sum -^sum lul 

« = max (T - ^(7,^(7)). (56) 

Figures 7(a) and 7(b)| plot the normalized sum-rate bound ifR^fi as a as a function of the estimated 
SNR 7 for uncoded QAM and Gaussian ensembles, respectively, at T = 5 and T = 50. As before, we 



December 30, 2009 



DRAFT 



21 





7(dB) 



7(dB) 



Fig. 7. Normalized sum-rate bound -^R^^ as a function of SNR 7 for (a) QAM and (b) Gaussian signaling in the low-SNR 
regime. 

use target error rate 1CT 3 and packet size n = 500. For the genie-aided probe rate -Rp eme (7) used to 
calculate £^^(7), we assumed that 7 7. The figures also show R™ 1V£ and the naive Shannon limit 
^ log 2 (l + 7), for comparison. Note that the difference between the naive rate i?Jj aive and the upper bound 
T^sum increases significantly as T decreases. This is due to the fact that, as T decreases, it is too 
costly to allocate a long probing interval, implying that the quality of SNR estimates decreases, so that 
more rate back-off is required. Note also that the difference between the naive rate and the upper bound 
increases as the SNR increases. This implies that the lack of perfect CSI becomes more costly as the 
SNR increases. 



V. An Asymptotically Optimal SNR Estimator 

The quality of SNR estimates based on ACK/NACKs from a probe interval is strongly dependent 
on both the probe rates {i? i } t ^ 1 and the probe interval T p . For the sum-rate upper bound derived in 
Sec. IIV-CI the probe rate -Rp en ' e (7) in d52l ) was selected in a genie-aided manner, assuming knowledge 
of the true SNR 7. Clearly, 7 is not known in practice. 

In this section, we develop a practical SNR estimator that, during the probing interval t € {1, . . . , T p }, 
recursively updates the probe rate R t and % (i-6-> the time-t estimatdj of 7) using the latest feedback pair 



We emphasize that 7t is the time-i estimate of the time-invariant SNR 7, and should not be confused with the time-varying 
SNR 7 t that was briefly used in Sec. Ull before the time-invariance assumption was introduced. 
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{Ft-x, Rt-i}- We show that the probe rate adaptation is asymptotically optimal, in that R t converges 
to < me ( 7 ) for any initial probe rate R\. Moreover, we show that our SNR estimator is asymptotically 
efficient and asymptotically normal, i.e., that the corresponding estimation error N t = % — 7 converges 
to a zero-mean Gaussian random variable whose variance is identical to the CRLB achieved with the 
genie-aided probe rate i?p enie (7). The normality of the error helps to justify the Gaussian approximation 
used to derive the rate bounds (|47T ) and (|48T ) for the uncoded QAM and Gaussian cases, respectively. 
The SNR Estimator: 

1) At time t = 1, choose an arbitrary rate Ri € 1Z and an arbitrary estimate 71. 

2) At each time t = 2, . . . , T p , update the estimate as 

F t -! -e(j t -i,Rt-i) 
It = It-i + TT — 77-771 75 — 77 P 7 ) 



Rt = argmax$(7t,i?), (58) 



and choose the rate i?+ as: 



where $(•,•) is the Fisher information as defined in (1451) . 
We prove the following for our estimator. 

Theorem 2: For both uncoded QAM and Gaussian ensembles, as T p — > 00, 

d 



T p ( 7Tp — 7) — > Nt p ~ ( 0, $- 1 ( 7 , i? 9 p eme (7)) • (59) 



Proof: See Appendix |Bj ■ 
Theorem |2] implies that our estimator (1571) is asymptotically efficient and consistent. Moreover, without 
any prior information on 7, rate allocation (T58T ) guarantees the performance achieved with the genie-aided 
probe rate R% e me (7). Next, we simulate the estimator. Instead of the t — 1 on the denominator, we use 
(t — l) 13 for various values of (3 6 (0, 1]. 

In Fig. [U a single realization of the estimator and the corresponding assigned rate are illustrated for 
different values of 7, over a block of T p = 500 probe packets of size n = 500 symbols. The value 
of 7 and the asymptotic rate Rp er " e (j) are also shown on the associated graphs. The initial points for 
the estimator are 71 = 3 dB, R\ = 1 bit/symbol, and the set of possible rates are 1Z = {1,2,..., 10} 
in bits/complex-symbol, i.e., the possible constellation sizes are integer powers of 2. For = 0.5, one 
can observe that the optimal rate is reached with approximately 20 probe packets for all values of SNR. 
Once that point is reached, the estimation error variance decays fairly slowly due to the low decay rate 
f3 = 0.5. With a higher 0, it takes longer to approach the vicinity of 7, from the initial value 71, but the 
estimation error variance is lower once in steady state. This observation is illustrated in Fig. |8(c)| where 
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Fig. 8. Examples trajectories of the recursive SNR estimator when uncoded QAM is used. 

(3 = 1 and the probing block size is T p = 2000 packets. In the realization corresponding to 7 = 20 
dB, the "steady state" is yet to be reached after 2000 packets. On the other hand, the amplitude of the 
fluctuations around the final point decay much faster, as one can observe in the realization corresponding 
to 7 = 10 dB. Different choices for j3 and the associated tradeoffs involved in stochastic approximation 
algorithms are studied in [26]. 
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Fig. 9. Examples trajectories of the recursive SNR estimator when Gaussian signaling is used. 



We illustrate our estimator response for Gaussian ensembles in Fig. |9l As the set of rates 1Z, we picked 
100 points, equally spaced between and 5 bits/real-symbol. The initial SNR estimate, 71 = dB, was 
much smaller than the initial one in the QAM simulations, but the initial rate, i?i = 0.5 bits/complex- 
symbol, was identical to the one in the QAM simulations. Here, we analyze SNR realizations 7 = 3, 10 
and 20 dB. With Gaussian ensembles, the convergence speed is slightly lower than that with QAM. While 
the convergence is almost immediate for 7 = 3 dB, it takes 30-40 packets for 10 dB and 130-140 packets 
for 7 = 20 dB. This difference is mainly due to the difference in the distances between the initial and 
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final points. On the other hand, due to the large size of the set of possible rates (unlike QAM, where 
only a few discrete points are possible), there exists some Rt G 1Z that is very close to the genie-aided 
probe rate i?p enie (7). Consequently, the estimation error variance decays much faster once R t comes near 
the vicinity of i?p enie (7). We also illustrate the estimator with (3 = 1 in Fig. 9(c) and one can notice the 
slow convergence, similar to the QAM simulations. 

VI. Conclusion 

In this paper, we studied rate adaptation based on ACK/NACK feedback. In particular, we studied 
methods that maximize data rate subject to a constraint on expected packet-error probability, assuming 
that the transmitter has no knowledge of the SNR distribution. Because optimal rate allocation was 
identified as a POMDP, which is impractical to implement, we focused on a suboptimal framework 
where a channel estimate is calculated based on previous feedback and a rate is chosen based on this 
channel estimate. To aid the initial rate allocation, we allowed the use of T p probe packets at the start 
of each data block. First we considered a so-called "naive" rate allocator that maximizes rate subject to 
a constraint on instantaneous packet-error probability, calculated from a given unbiased estimate 7 of 
the true SNR 7. Due to the inevitable error in SNR estimation, we argued that one must either back- 
off the naive rate, or correspondingly increase the SNR, to meet the stricter expected error probability 
constraint. Based on a Gaussian approximation of the estimation error N = 7 — 7, we derived conditions 
on the "effective estimator SNR" 7 2 /cr 2 N ^ that are necessary for the existence of a feasible transmission 
rate, as well as an upper bound on the transmission rate when this necessary condition is satisfied. 
This latter analysis was carried out for both uncoded QAM signaling and random Gaussian signaling 
(the latter in both the low-SNR and high-SNR regimes). Next, we considered unbiased SNR estimation 
via ACK/NACK feedback. First, we lower bounded the error variance of those estimates (for general 
signaling schemes), and based on that bound, we lower bounded the necessary probing duration T p and 
upper bounded the sum data rate (for both uncoded QAM signaling and random Gaussian signaling). 
Finally, we proposed a practical unbiased ACK/NACK-based SNR estimator and showed that (as the 
probe duration increases) our estimator is asymptotically efficient and asymptotically normal. 
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Appendix A 



Derivation of T™ m for uncoded QAM and Gaussian Signalling 

In this section, we derive (|47l ) and (|48l ). For brevity, we write e = £ p (7, R p ) and e' = £p(7, R p ). Recall 
that, from d25l and d46l ), we have for, uncoded QAM, 

e(l-e)2(a + ln0.1n) 



rpmm 
1 P 



where, from (f2Tb . 
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From d34l and (|46l ), we have for, Gaussian signaling in the low-SNR regime, 
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Appendix B 
Proof of Theorem [2] 

We will directly apply Theorem 2.1 [27, p. 223]. The necessary conditions for asymptotic normality 
and asymptotic efficiency to hold in our system are: 

1) The expectation, E [Ft], of observation F t must exist and must be bounded: 
E [Ft] = e(7, Rt) exists and is clearly bounded by 1 for all t. 



2) The partial derivative 



dE[F t ] 



d-y 



must be jointly continuous (in 7 and Rt) and bounded 

d-, 



9e fr!^ I is continuous and bounded for 



For both QAM (gOj and Gaussian ^29j signals, |^§£ 
7 ^ and 0. 

3) The variance var(F f ) of observation F t must be continuous in 7 and i?t. 

For both QAM and Gaussian signaling, var (Ft) = £(7, Rt)(l— 2(7, i?t)) is continuous and bounded 
for 7 ^ and i? t ^ 0. 

4) Fisher information $(7, must be continuous, positive and for each 7, it must have a unique 
maximum in R t . 

For both QAM and Gaussian signaling, the Fisher information $(7, as given in (|45l ) is contin- 
uous and positive for 7 ^ and R t ^ 0. Moreover, it has a unique maximum R t = R^ me {~f) for 
each 7^0, since $(7, i?t) is a strictly concave and continuous function of Rt- 

5) For some b > 2, E | & ] must be bounded for all possible values of 7 and associated rate i?p enie (7). 
Since F t G {0,1}, we know that E is bounded for all b > 2 and for all values of 
(7,i? 9 p enie (7))- 



Furthermore, the asymptotic efficiency [27, p. 186,224] of the estimator is 

V(7,< me (7))J 
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The asymptotic optimality, i.e., Tncr?, 1- 
consequence of Theorem 2.1 [27, p. 223]. 
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